# Multi-Scenario Adaptation

TRELLIS Image Large
MIT
The image-conditioned version of TRELLIS is a large-scale 3D generation model capable of generating 3D content from images.
3D Vision English
T
Surn
80
0
TRELLIS Image Large Fork
MIT
TRELLIS is a large-scale 3D generation model that achieves scalable and versatile 3D content creation through structured 3D latent variables.
3D Vision English
T
larsquaedvlieg
393
0
Bge Large Zh V1.5 GGUF
MIT
BAAI/bge-large-zh-v1.5 is a Chinese sentence transformer model primarily used for feature extraction and sentence similarity calculation.
Text Embedding Chinese
B
mradermacher
536
1
Ade20k Semantic Eomt Large 512
MIT
This model is developed based on the paper 'Your ViT is Actually an Image Segmentation Model' and is a Vision Transformer model for image segmentation tasks.
Image Segmentation
A
tue-mps
108
0
Light R1 14B DS GGUF
Apache-2.0
Light-R1-14B-DS is a 14B-parameter quantized large language model supporting text generation tasks, designed for efficient inference in resource-constrained environments.
Large Language Model
L
qihoo360
2,784
9
Huihui Ai.granite Vision 3.2 2b Abliterated GGUF
Granite Vision 3.2 2B Abliterated is a vision-language model focused on image-to-text conversion tasks.
Image-to-Text
H
DevQuasar
724
1
Skyreels V1 Hunyuan I2V HFIE
Other
SkyReels-V1-Hunyuan-I2V is a text-to-video generation model developed by Tencent SkyworkAI, based on the Hunyuan architecture, supporting video content generation from text input.
Text-to-Video English
S
jbilcke-hf
21
4
70B L3.3 Mhnnn X1
A large language model fine-tuned based on Llama-3-70B-Instruct, specializing in creative text generation and multi-task processing
Large Language Model Transformers
7
Sao10K
150
7
Japanese Parler Tts Large Bate
Other
A Japanese text-to-speech model fine-tuned based on parler-tts-large-v1, capable of generating high-quality Japanese speech
Speech Synthesis Transformers Japanese
J
2121-8
114
17
Segmentation
MIT
This is an end-to-end speaker segmentation model for voice activity detection, overlap speech detection, and resegmentation tasks.
Audio Processing TensorBoard
S
salmanshahid
1,790
0
Smollm2 Prompt Enhance GGUF
Apache-2.0
SmolLM2-Prompt-Enhance is a text generation model fine-tuned based on SmolLM2-135M-Instruct, focusing on prompt enhancement tasks.
Large Language Model Transformers English
S
mav23
621
1
Elizabeth Olsen Sdxl Flux
Other
A LoRA model customized based on FLUX.1-dev foundation model, specializing in generating photorealistic images of Elizabeth Olsen (particularly as Marvel's Scarlet Witch)
Text-to-Image
E
Keltezaa
15
3
Stable Diffusion V1.5
Openrail
A latent diffusion model for text-to-image generation, supporting 512x512 resolution image generation
Image Generation
S
stablediffusiontutorials
1,291
5
Moondream Caption
Apache-2.0
A customized small vision model based on Moondream2, fine-tuned specifically for image caption generation tasks
Image-to-Text Transformers
M
wraps
108
9
Yolov10n
YOLOv10 is a real-time end-to-end object detection model proposed by Tsinghua University, known for its efficiency and accuracy.
Object Detection Safetensors
Y
jameslahm
3,326
17
Yolov10b
YOLOv10 is a real-time end-to-end object detection model that offers a balance between efficient detection performance and accuracy.
Object Detection Transformers
Y
onnx-community
14
1
Burp 7B
Other
BuRP is a versatile roleplay model capable of highly interactive engagement with users, never rejecting any proactive requests while strictly adhering to specific dialogue formats.
Large Language Model Transformers English
B
ChaoticNeutrals
21
16
Promcse Bert Base Zh
MIT
PromCSE is a supervised learning-based sentence embedding model specifically designed for calculating Chinese sentence similarity.
Text Embedding Transformers Chinese
P
hellonlp
1,761
5
Speecht5 Finetuned Zh TW
MIT
A speech processing model based on the SpeechT5 architecture, fine-tuned for Taiwanese Mandarin
Speech Synthesis Transformers
S
zongxiao
47
0
Car Brands Classification
Apache-2.0
A pre-trained image classification model based on the BEiT architecture, supporting Vietnamese labels, suitable for vision tasks
Image Classification Transformers Other
C
lamnt2008
19
3
Vivid
MIT
A model for generating prompts for Stable Diffusion models
Image Generation English
V
NoxiusEngine
26
3
SBERT JSNLI Base
This is a model based on sentence-transformers, capable of mapping sentences and paragraphs into a 768-dimensional dense vector space for tasks such as sentence similarity calculation, clustering, and semantic search.
Text Embedding Transformers
S
MU-Kindai
343
0
Sdg Sentence Transformer
This is a model based on sentence-transformers that maps sentences and paragraphs into a 768-dimensional dense vector space, suitable for tasks such as sentence similarity calculation and semantic search.
Text Embedding Transformers
S
peter2000
13
0
S BlueBERT
This is a model based on sentence-transformers, capable of mapping sentences and paragraphs into a 768-dimensional dense vector space, suitable for tasks such as clustering and semantic search.
Text Embedding Transformers
S
menadsa
58
0
Bpr Gpl Webis Touche2020 Base Msmarco Distilbert Tas B
This is a model based on sentence-transformers that can map sentences and paragraphs into a 768-dimensional dense vector space, suitable for tasks such as clustering or semantic search.
Text Embedding Transformers
B
income
41
0
Cifar 10 Vgg Pretrained
Image classification model implemented with PyTorch, capable of recognizing multiple common object categories
Image Classification Transformers
C
amehta633
22
0
Voice Activity Detection
MIT
Voice activity detection model based on pyannote.audio 2.1, used to identify speech activity segments in audio
Speech Recognition
V
pyannote
7.7M
181
Koelectra Base V3 Generalized Sentiment Analysis
Apache-2.0
Korean sentiment analysis model based on KoELECTRA-v3, used to determine the positive or negative sentiment tendency of text.
Text Classification Transformers Korean
K
Copycats
3,459
9
Bertweet Base Emotion Analysis
English sentiment analysis model trained on the EmoEvent corpus, utilizing the BERTweet architecture
Text Classification Transformers English
B
finiteautomata
27.87k
15
Paraphrase MiniLM L6 V2
Apache-2.0
This is a sentence transformer model that maps sentences and paragraphs into a 384-dimensional dense vector space, suitable for tasks such as clustering or semantic search.
Text Embedding Transformers
P
DataikuNLP
38
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase